Data Fusion and Object Identification
نویسندگان
چکیده
Many databases exist, which are defined on the same universe of objects, but there is no global ID available at all. To fuse data it is necessary to identify objects. As an example think of several population databases, where only name and address are common attributes. Data quality varies over the sources. Therefore the identification is more complicated due to erroneous and obsolete data. The usual approach of data integration is to establish a global schema, which is derived from the local schemata. In the case of missing ID’s the global schema is not sufficient for identification. Hence we compute appropriate derivable attributes. Records are compared pairwise based on those derived attributes. The underlying comparison space is partitoned into at least two classes same and not same. To classify records according to these comparison values, supervised learning can be applied. Data Fusion is more than integration because it merges data and evaluates the quality of the merging. We develop a unified framework for such kind of data fusion. We cover the feature selection problem, and embed the data fusion problem into supervised classification. This approach makes it possible to select the best procedure for data fusion and object identification from solid methodologies like likelihood ratio tests (record linkage), classification trees or non linear classification. Unfortunately, for supervised learning a random sample of pairs must be taken and each pair labeled. Alternatively, a weighted similarity measure can be defined and used for classification. Referring to modern theory of distributed, heterogeneous databases we add a third component to the three-layer Mediator-Wrapper architecture of data integration introduced by Wiederhold: The Identificator. We illustrate the approach by an example.
منابع مشابه
Urban Vegetation Recognition Based on the Decision Level Fusion of Hyperspectral and Lidar Data
Introduction: Information about vegetation cover and their health has always been interesting to ecologists due to its importance in terms of habitat, energy production and other important characteristics of plants on the earth planet. Nowadays, developments in remote sensing technologies caused more remotely sensed data accessible to researchers. The combination of these data improves the obje...
متن کاملObject-Based Classification of UltraCamD Imagery for Identification of Tree Species in the Mixed Planted Forest
This study is a contribution to assess the high resolution digital aerial imagery for semi-automatic analysis of tree species identification. To maximize the benefit of such data, the object-based classification was conducted in a mixed forest plantation. Two subsets of an UltraCam D image were geometrically corrected using aero-triangulation method. Some appropriate transformations were perfor...
متن کاملApplication of Combined Local Object Based Features and Cluster Fusion for the Behaviors Recognition and Detection of Abnormal Behaviors
In this paper, we propose a novel framework for behaviors recognition and detection of certain types of abnormal behaviors, capable of achieving high detection rates on a variety of real-life scenes. The new proposed approach here is a combination of the location based methods and the object based ones. First, a novel approach is formulated to use optical flow and binary motion video as the loc...
متن کاملDetecting Surface Waters Using Data Fusion of Optical and Radar Remote Sensing Sensor
Identification and monitoring of surface water using remote sensing have become very important in recent decades due to its importance in human needs and political decisions. Therefore, surface water has been studied using remote sensing systems and Sentinel-1 and Sentinel-2 sensors in this study. In this paper, two data fusion approaches and decision fusion improve the accuracy of surface wate...
متن کاملObject Level Strategy for Spectral Quality Assessment of High Resolution Pan-sharpen Images
Panchromatic and multi-spectral images produced by the remote sensing satellites are fused together to provide a multi-spectral image with a high spatial resolution at the same time. The spectral quality of the fused images is very important because the quality of a large number of remote sensing products depends on it. Due to the importance of the spectral quality of the fused images, its eval...
متن کاملHybrid method for full identification of buried objects and surrounding media
This study describes a hybrid technique for identification of buried objects. The objects shape and electromagnetic profile are reconstructed from evaluations of electrical permittivity and conductivity. The method suggests a combination of linear sampling and optimization. Linear sampling method (LSM) is used to recover shape and metaheuristic optimizations essential to reconstruct the inside...
متن کامل